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Abstract 

We suggest a new color distance based on two observations. First, percep- 
tual color differences were designed to be used to compare very similar colors. 
They do not capture human perception for medium and large color differences 
well. Thresholding was proposed to solve the problem for large color differences, 
i.e. two totally different colors are always the same distance apart. We show that 
thresholding alone cannot improve medium color differences. We suggest to alle- 
viate this problem using basic color terms. Second, when a color distance is used 
for edge detection, many small distances around the just noticeable difference may 
account for false edges. We suggest to reduce the effect of small distances. 

1 Introduction 

Color difference perception of just notably different colors and defining distances be- 
tween very-similar colors has received considerable work l22l |29l |2"T1 l35l l46l . In 
the CIE (International Commission on Illumination) community, distances of up to 
7 CIELAB units, where 1 CIELAB unit approximately correspond to 1 just notice- 
able difference, are considered medium distances fTTl . In this paper we refer to 0-7 
CIELAB distances as very-similar as they capture just a small fraction of similar col- 
ors. See Fig.Q] 

The CIEDE2000 color difference is considered the state of the art perceptual color 
difference EO @7] ED . CIEDE2000's recommended range for use is to 5 CIELAB 
units J2). The COM dataset was used to train and perceptually test the CIEDE2000 
color difference. More than 95 percent of the distances between color pairs in the 
COM dataset are below 5 CIELAB units apart. 

It was pointed out that the resulting color differences do not correspond well with 
human perception for medium to large distances. Rubner et al. 0321 and Ruzon and 
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Euclidean distance to blue in L*a*b* space 




Figure 1 : This figure should be viewed in color, preferably on a computer screen. 
The x-axis is colors image, where the colors are sorted by their Euclidean distance in 
L*a*b* space to the blue color. We can see that distances up to 7 CIELAB units capture 
just a small fraction of similar colors (zoom in to see in the left a black line touching 
the y-axes in 7 CIELAB units). 

Tomasi ll33l used a negative exponent on the color difference. Namely, all totally 
different colors are essentially assigned the same large distance. Pele and Werman l27l 
noted that a negative exponent changes the values in the small distance range. Pele and 
Werman ||27l and Rubner et al. H321 observed a reduction in performance due to this 
change. Pele and Werman suggested thresholding the color difference as it does not 
change the small distances. Thresholding color distances is justified by the fact that if 
people are directly asked for a judgment of the dissimilarity of colors far apart in color 
space, subjects typically find themselves unable to express a more precise answer than 
"totally different" ifTHl . An additional advantage of thresholding color distances is that 
it allows fast computation of cross-bin distances such as the Earth Mover's Distance 
11271 or using the transformation to a similarity measure of one minus the distance 
divided by the threshold, the Quadratic-Chi l28l . 

This paper shows that CIEDE2000 is not a good distance for the medium range 
and using any monotonic function of CIEDE2000 (including a thresholding function) 
cannot solve the problem. For example, using a thresholding function we cannot make 
DarkSkyBlue be more similar to Blue than to HotPink. See Fig.|2]for more examples. 

We suggest an improvement based on basic color terms. Specifically we use Berlin 
and Kay's eleven English basic color terms[6|. However, the generalization to other 
color terms is straight forward. We suggest adding to color differences the distance 
between their basic color terms probability vectors. As basic color terms are correlated 
(e.g. red and orange) we suggest using a cross-bin distance for these probability vectors. 
That is, a distance which takes the relationships between bins (each bin represents a 
basic color term) into account. Specifically we use the Earth Mover's Distance l32l 
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Figure 2: This figure should be viewed in color, preferably on a computer screen. 
Each sub-figure (a)-(d) contains a pair of similar colors and a pair of different colors. In 
all of these examples, the CIEDE2000 distance between the visually similar colors is 
higher than the distance between the different colors. Our proposed distance succeeds 
in all of these examples. 



as it was used successfully in many applications (e.g. ji32,|3Tl[33]|27] and references 
within). 

The probability vectors are obtained with the color naming method developed by 
van de Weijer et al. [44|. Other methods for color naming such as |9] [19] [34j [3] [T6] 
l26l |4j [24] [23] [3 can also be used. We chose van de Weijer et al. method as it has 
excellent performance on real-world images and as the code for it is publicly available. 
However, CIEDE2000 was learned under calibrated conditions, while van de Weijer et 
al. method was learned from natural images. Thus, other color naming methods might 
produce better results. This is left for future work. 

Our proposed solution is not equivalent to increasing the weight of the hue com- 
ponent in the color difference. Color names are not equivalent to hue. For example, 
although a rainbow spans a continuous spectrum of colors, people see in it distinct 
bands which correspond to basic color terms: red, orange, yellow, green, blue and pur- 
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Figure 3: This figure should be viewed in color, preferably on a computer screen. Use 
the pdf viewer's zoom to see the colors. We show colors, sorted by their distance to 
the color on the left. COL DIST is our new perceptual color difference. Several observa- 
tions can be derived from these graphs. First, our distance is perceptually better in the 
medium distance range. Note that the group of similar colors (left side of each color 
legend) is more similar to the color on the left using our distance. For example, in 
the top light blues are similar to blue, while using CIEDE2000 they are very different 
(thus appear on the right). It should be noted that our distance uses a sigmoid function, 
so that very similar colors on the left are essentially assigned the same small distance 
and totally different colors on the right are essentially assigned the same large distance. 
Finally, although our distance is perceptually more meaningful, it is still far from being 
perfect. 



pie. In addition, some basic color terms are not different in their hue component, e.g. 
achromatic colors such as white, gray and black or orange and brown which shares the 
same hue. 

A second problem that occurs when using color differences for edge detection is 
that many small distances around the just noticeable difference may account for false 
edges. We suggest to use a sigmoid function to reduce the small distances effect. As we 
mentioned before, using a negative exponent function in order to assign to all totally 
different color pairs the same distance reduced performance l32ll27l . We explain this 
by the fact that a negative exponent is a concave function. We show that a convex 
function should be applied to small differences. 

We present experimental results for color edge detection. We show that by using 
our new color difference the results are perceptually more meaningful. 

Our solution is just the first step of designing a perceptual color difference for the 
full range of distances. Our major contribution is raising the problem of current state- 
of-the-art color differences in the small and medium distance range. 

This paper is organized as follows. Sectionals an overview of related work. Sec- 
tion [3] introduces the new color difference. Section [4] presents the results. Finally, 
conclusions are drawn in Section[5] 
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2 Related Work 

MacAdam's j22j pioneering work on chromaticity discrimination ellipses, which mea- 
sured human perception of just noticeable differences led the way to the development 
of the L*a*b* space l29l which is considered perceptually uniform; i.e. for very-similar 
colors, the Euclidean distance in the L*a*b* space corresponds to the human percep- 
tion of color difference well. Luo et al. ED developed the CIEDE2000 color difference 
which is now considered the state of the art perceptual color difference l2Tll47l[T7l . 

Although color is commonly experienced as an indispensable quality in describing 
the world around us, state-of-the art computer vision methods are mostly based on 
shape description and ignore color information. Recently this has changed with the 
introduction of new color descriptors [39] [H |25] |40j 02] [8] [361 . However, although 
color is a point-wise property (e.g. bananas are yellow), most of these features capture 
geometric relations such as color edges. 

Wertheimer [lj suggested that among perceptual stimuli there are "ideal types" 
that are anchor points for perception. Rosch [30] proposed that in certain perceptual 
domains, such as color, salient prototypes develop non arbitrarily. An influential paper 
by Berlin and Kay [6 | defined basic colors as color names in a language which are ap- 
plied to diverse classes of objects and whose meaning is not subsumable under one of 
the other basic color names and which are used consistently and by consensus by most 
of the speakers of the language. In their pioneering anthropological study, they found 
that color was usually partitioned into a maximum of eleven basic color categories of 
which three were achromatic (black, white, grey) and eight chromatic (red, green, yel- 
low, blue, purple, orange, pink and brown). This partitioning was a universal tendency 
to group color around specific focal points as was conjectured by Wertheimer JH and 
Rosch |3Q| . 

Considerable work has been carried out in the field of computational color nam- 
ing, see e.g. flU [J9j [34] [3] [16J |26] g] EU |23l [5] SU and references within. Recently 
van de Weijer et al. |44l presented a new color naming method based on real-world 
images. The color names are Berlin and Kay's (6) eleven English basic color terms. 
Van de Weijer and Schmid [43 1 showed that a color description based on these color 
names outperforms descriptions based on photometric invariants. The explanation is 
that photometric invariance reduces the discriminative power of the descriptor. 

Inspired by van de Weijer and Schmid's work we suggest using the basic color 
names to correct the state-of-the-art color difference, CIEDE2000 in the medium dis- 
tance range. 

3 COL DIST : The New Color Difference 

Given two colors C 1 = [R 1 , G 1 , B 1 ] and C 2 = [R 2 ,G 2 , B 2 }, we first convert them 
intoL*a*b* : S 1 = [L 1 ,a 1 ,& 1 ] and S 2 = [L 2 ,a 2 ,b 2 ]. Second, we compute the basic 
color term probability vectors: P 1 , P 2 ; where P" is the probability that the color C" 

1 We used Matlab's default conversion which uses CIE illuminant D50 as the reference illuminant, known 
as the white point, which is also the default illuminant specified in the International Color Consortium spec- 
ifications. 
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is the basic color term i (i.e. black, blue, brown, grey, green, orange, pink, purple, red, 
white or yellow). These probability vectors are computed using the van de Weijer et al. 
color naming method P4l . Now each color C n is represented by an 14-dimensional 
vector: V n = [S n ,P n ] = [L n , a n , b n , Ff, . . . , Pft]. The distance between the two 
colors (parameterized with T, D, a and Z) is defined as: 



, g2 = m in(CIEDE2000(^ 1 ,5 2 ),r) (J) 

d 2 (P\P 2 ) =EMD(P 1 ,P 2 ,L>) (2) 
d 3 (V 1 ,V 2 ) = adi + (l-a)d 2 (3) 

COL msT (V\ V 2 ) = - j— — (4) 

In Eq.Q] di is a thresholded and scaled CIEDE2000 color difference. We threshold 
it as it is recommended for use only for small distances (2). We used T = 20 as was 
used in Pele and Werman l27l . We divide by T so that d\ is between and 1. 

In Eq. |2] c?2 is the distance between the two basic colors probability vectors. As 
the bins in the eleven basic colors probability vectors are correlated (e.g. orange and 
red), we use the Earth Mover's Distance that takes this correlation into account. The 
correlation is encoded in D which is an 11 x 11 matrix, where D^ is the distance 
between basic color term i to basic color term j. We estimated D using the joint 
distribution of the basic color terms. That is, given the matrix M of all probability 
vectors for the colors in the RGB cube (2 15 x 11 matrix, as each dimension of the RGB 
cube was quantized with jumps of 8 l44l ) we define D = as: 



„ minfi),-,-, t) 

= — (6) 

We threshold Dij as EMD is recommended to be used with thresholded ground 
distances ([27) . We used the threshold t = 0.7. Finally we scale it so that < Dij < 1 
(which implies < di < 1). The resulting matrix (see Fig. 0]i is perceptually plausible; 
i.e. similar basic color terms are: grey and white, grey and black, orange and red, etc. 

The Earth Mover's Distance (EMD) ll32ll is defined as the minimal cost that must be 
paid to transform one histogram into another, where there is a "ground distance" (that 
is, the matrix D) between the basic features that are aggregated into the histogram. 
Here the basic features are the eleven English basic color terms. The formula for the 
EMD between the two probability vectors P 1 and P 2 is defined a@ 



2 This is a simplification of the original definition for the case where the histograms are probability vec- 
tors. 
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Figure 4: The learned basic color terms ground distance matrix. 



EMD(P\ P 2 , D) = min ^ s.t 

X] ^ = P i ' X! = P i ' (7) 

j * 

In Eq.[3] d3 is a linear combination of rfi and di- We used a = |. 

Finally, in Eq.|4l the distance is scaled so that it is in the range [— ■§■] (we used 
Z = 10) and then the logistic function (a sigmoid function) is finally applied. The 
sigmoid function reduces the effect of small distances and essentially gives all totally 
different colors the same distance. 



4 Results 

In this section we present color edge detection results. We used Ruzon and Tomasi 
generalized compass edge detector |33| . We used this method for two reasons. First, 
the code is publicly available. Second, the code uses only color cues for the edge 
detection which enables us to isolate color difference performance. 

Ruzon and Tomasi's method [33] divides a circular window around each pixels in 
half with a line segment. Then it computes a sparse color histogram (coined signature 
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in their paper) for each half and computes the Earth Mover's Distance (EMD) l32l 
between the two histograms. The EMD uses a ground distance matrix D between the 
colors. Ruzon and Tomasi converted the images to L*a*b* and then used a negative 
exponent of the Euclidean distance as the ground distance between colors: 



d e (S 1 ,S 2 ) = l-e— V m (8) 

Ruzon and Tomasi used 7 = 14 in their experiments. We compare the edge de- 
tection results using this distance to our proposed COL DIST . We compared also to d\ 
which is a thresholded CIEDE2000 distance as was used by Pele and Werman for im- 
age retrieval 11271 . We also tried our proposed COL DIST without the sigmoid function or 
without the color correction (a = 1) or without the CIEDE2000 term (a = 0) but the 
results using COL DIST were the best. Results are presented in Figs. I5I6I7I8I The results 
show that the new color difference is able to detect color edges much better than the 
state of the art. The resulting edge maps are much cleaner. See figures captions for 
more details. 



5 Conclusions 

We presented a new color difference - COL DIST and showed that it is perceptually more 
meaningful than the state of the art color difference - CIEDE2000. We believe that this 
is just the first step in designing perceptual color differences which perform well in the 
medium range. 

It is easy to generalize our method to other color name sets (such as the Russian 
which separates blue into goluboi and siniy). All one needs to do is to calculate the 
ground distance between all color terms. This can be done by using the joint distribu- 
tion of the new set of color terms. In future work it will be interesting to check other 
color naming methods such as (9J [J9l [34l El HS1 Eg] SI [23] El 

A major difficulty of analyzing color images is the illumination variability of scenes. 
Color invariants are often used to overcome this problem. However, Van de Weijer et al. 
l44l |43l showed that invariants are not discriminative enough. For example, invariants 
usually do not distinguish between achromatic colors (black, gray and white). Using 
color constancy or partial normalization algorithms lfl2l [T3l [TBI |4T1 ITT1 [TOl l20l [7] |37) 
which do not necessarily reduce all distinctiveness may partially alleviate this problem. 
This method was used to improve color naming by Benavente et al. J3). 

Color perception is also affected by spatial and texture cues. It will be interesting 
to combine COL DIST with spatial and texture models 1481 1451 l38l 
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(COL DIST ) (IM) 



Figure 5: Edge detection with the generalized compass edge detection l33ll using 
the following color differences: (NE) A negative exponent applied on the Euclidean 
distance in L*a*b* space (used in 11331 ). (TC) A thresholded CIEDE2000 distance 
(used in ||27l for image retrieval). See Eq. Q] (COL DISX ) Our proposed COL DIST . (IM) 
The original image. 

Our result is much cleaner. Note that our method detects the right boundary of the 
basket without detecting many false edges, while in (NE) and (TC) the false edges 
magnitude is larger than the right boundary of the basket. 
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Figure 6: Edge detection with the generalized compass edge detection l33l using 
the following color differences: (NE) A negative exponent applied on the Euclidean 
distance in L*a*b* space (used in ll33l ). (TC) A thresholded CIEDE2000 distance 
(used in ||27l for image retrieval). See Eq. Q] (COL DIST ) Our proposed COL DIST . (IM) 
The original image. 

Our results are much cleaner. Note that on the top the clean detection of the bushes 
boundaries. 
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Figure 7: Edge detection with the generalized compass edge detection l33ll using 
the following color differences: (NE) A negative exponent applied on the Euclidean 
distance in L*a*b* space (used in J33]). (TC) A thresholded CIEDE2000 distance 
(used in ll27l for image retrieval). See Eq. Q] (COL DIST ) Our proposed COL DIST . (IM) 
The original image. 
Our results are much cleaner. 
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Figure 8: Edge detection with the generalized compass edge detection l33l using 
the following color differences: (NE) A negative exponent applied on the Euclidean 
distance in L*a*b* space (used in ll33l ). (TC) A thresholded CIEDE2000 distance 
(used in ||27l for image retrieval). See Eq. Q] (COL DIST ) Our proposed COL DIST . (IM) 
The original image. 

Our results are much cleaner. Note the strong responses on the fur of the bear and in 
the left person on the top T-shirt using (NE) and (TC). Note that the spots around the 
swimmer in all methods are due to successful detection of the water drops. 
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